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1 Introduction 



There are many important models that give rise to conditional moment restrictions. These 
restrictions often take the form 

E[/5(y,X,ao)|t^] =0, 

where p{Y, X, a) has a known functional form but oq is unknown. Parametric models (i.e., mod- 
els when ao is finite dimensional) of this form are well known from the work of Hansen (1982), 
Chamberlain (1987), and others. Nonparametric versions (i.e., models when oq is infinite di- 
mensional) are motivated by the desire to relax functional form restrictions. Identification and 
estimation of linear nonparametric conditional moment models have been studied by Newey 
and Powell (2003), HaU and Horowitz (2005), Blundeh, Chen, and Kristensen (2007), Darolles, 
Fan, Florens, and Renault (2011), and others. 

The purpose of this paper is to derive identification conditions for ao when p may be 
nonlinear in a and for other nonlinear nonparametric models. Nonlinear models are important. 
They include models with conditional quantile restrictions, as discussed in Chernozhukov and 
Hansen (2005) and Chernozhukov, Imbens, and Newey (2007), and various economic structural 
and semiparametric models, as further discussed below. In this paper we focus on conditions 
for local identification of these models. It may be possible to extend these results to provide 
global identification conditions. 

In parametric models there are easily interpretable rank conditions for local identification, as 
shown in Fisher (1966) and Rothenberg (1971). We give a pair of conditions that are sufficient 
for parametric local identification from solving a set of equations. They are a) pointwise differ- 
entiability at the true value, and b) the rank of the derivative matrix is equal to the dimension 
of the parameter oq. We find that the nonparametric case is different. Differentiability and the 
nonparametric version of the rank condition may not be sufficient for local identification. We 
suggest a restriction on the neighborhood that does give local identification, via a link between 
curvature and an identification set. We also give more primitive conditions for Hilbert spaces, 
that include interesting econometric examples. In addition we consider semiparametric mod- 
els, providing conditions for identification of a finite dimensional Euclidean parameter. These 
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conditions are based on "partialling out" the nonparametric part and allow for identification of 
the parametric part even when the nonparametric part is not identified. 

The usefulness of these conditions is illustrated by three examples. One example gives 
primitive conditions for local identification of the nonparametric endogenous quantile models 
of Chernozhukov and Hansen (2005) and Chernozhukov, Imbens, and Newey (2007), where 
primitive identification conditions had only been given previously for discrete regressors. An- 
other example gives conditions for local identification of a semiparametric index model with 
endogeneity. There we give conditions for identification of parametric components when non- 
parametric components are not identified. The third example gives sufficient conditions for 
local identification of a semiparametric consumption capital asset pricing model. 

In relation to previous literature, in some cases the nonparametric rank condition is a 
local version of identification conditions for linear conditional moment restriction models that 
were considered in Newey and Powell (2003). Chernozhukov, Imbens, and Newey (2007) also 
suggested differentiability and a rank condition for local identification but did not recognize 
the need for additional restrictions on the neighborhood. Florens and Sbai (2010) recently 
gave local identification conditions for games but their conditions do not apply to the kind of 
conditional moment restrictions that arise in instrumental variable settings and are a primary 
subject of this paper. 

Section 2 presents general nonparametric local identification results and relates them to 
sufficient conditions for identification in parametric models. Section 3 gives more primitive 
conditions for Hilbert spaces and applies them to the nonparametric endogenous quantile model. 
Section 4 provides conditions for identification in semiparametric models and applies these to 
the endogenous index model. Section 5 discusses the semiparametric asset pricing example and 
Section 6 briefly concludes. The Appendix contains additional lemmas and all the proofs. 

2 Nonparametric Models 

To help explain the nonparametric results and give them context we give a brief description of 
sufficient conditions for local identification in parametric models. Let a be a p x 1 vector of 
parameters and m{a) a J x 1 vector of functions with m(ao) = for the true value oq. Also 
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let |-| denote the Euclidean norm in either or M depending on the context. We say that ao 
is locally identified if there is a neighborhood of ao such that m{a) ^ for all a ^ ao in the 
neighborhood. Let ni' denote the derivative of m(a) at oq when it exists. Sufficient conditions 
for local identification can be stated as follows: 

If m(a) is dijferentiable at oq c-'iT'd rank[m') = p then ao is locally identified. 

This statement is proved in the Appendix. Here the sufficient conditions for parametric local 
identification are pointwise differentiability at the true value ao and the rank of the derivative 
equal to the number of parameters. 

In order to extend these conditions to the nonparametric case we need to modify the notation 
and introduce structure for infinite dimensional spaces. Let a denote a function with true 
value ao and m(a) a function of a, with m(ao) = 0. Conditional moment restrictions are an 
important example where p{Y, X, a) is a finite dimensional residual vector depending on an 
unknown function a and m(a) = E[/9(y, X, a)\W]. We impose some mathematical structure by 
assuming that a G a Banach space with norm |H|^ and m(a) £ B, a Banach space with a 
norm ||-||g, i.e. m : A>-^ B. The restriction of the model is that ||m'(ao)||g = 0. The notion of 
local identification we consider is: 

Definition: ao is locally identified on J\f C A if \\m{a)\\jg > for all a £ M with a ^ ao- 

This local identification concept is more general than the one introduced by Chernozhukov, 
Imbens and Newey (2007). Note that local identification is defined on a set N in A. Often 
there exists an e > such that is a subset of an open ball 

A^e = {a G ^ : ||a — ao||^ < e}- 

It turns out that it may be necessary for M to be strictly smaller than an open ball A4 in A, 
as discussed below. 

The nonparametric version of the derivative will be a continuous linear map m' : A ^ B. 
Under the conditions we give, m' will be a Gateaux derivative at ao, that can be calculated as 

d 

m'h = —m{ao + th)\t=o (2.1) 
[3] 



for h & A and t a scalar. Sometimes we also require that for any 5 > there is e > with 

\\m{a) — m{ao) — m'(a — ao)\\p, 

n n ^ 

II" - "olU 

for all a G A^. This is Frechet differentiability of m(a) at ag (which implies that the linear map 
m' : A B is continuous). Frechet differentiability of estimators that are functionals of the 
empirical distribution is known to be too strong, but is typically satisfied in local identification 
analysis, as shown by our examples. 

In parametric models the rank condition is equivalent to the null space of the derivative 
matrix being zero. The analogous nonparametric condition is that the null space of the linear 
map m' is zero, as follows: 

Assumption 1 (Rank Condition): There is a set M' such that ||m'(a — ao)|le > for 
all a gA/"' with q / oq. 

This condition is familiar from identification of a linear conditional moment model where 
Y = ao{X) + U and E[U\W] = 0. Let p{Y,X,a) = Y - a(X). Here m{a) = E[Y - a{X)\W] 
so that m'h = —E[h{X)\W]. In this case Assumption 1 requires that E[a{X) — aQ{X)\W] / 
for any a £j\f' with a — oq ^ 0. For J\f' = A this is the completeness condition discussed in 
Newey and Powell (2003). Andrews (2011) has recently shown that if X and W are continuously 
distributed, there are at least as many instruments in W as regressors in X, and the conditional 
distribution of X given W is unrestricted (except for a mild regularity condition), then the 
completeness condition holds generically, in a sense defined in that paper. In Section 3 we also 
give a genericity result for a different range of models. For this reason we think of Assumption 
1 with A/"' = ^ as a weak condition when there are as many continuous instruments W as the 
endogenous regressors X, just as it is in a parametric linear instrumental variables model with 
unrestricted reduced form. It is also an even weaker condition if some conditions are imposed 
on the deviations, so in the statement of Assumption 1 we allow it to hold only on J\f' C A. 
For example, if we restrict a — oq to be a bounded function of X, then in linear conditional 
moment restriction models Assumption 1 only requires that the conditional distribution of X 
given W be bounded complete, which is known to hold for even more distributions than does 
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completeness. This makes Assumption 1 even more plausible in models where ao is restricted to 
be bounded, such as in Blundell, Chen and Kristensen (2007). See, for example, Mattner (1993), 
Chernozhukov and Hansen (2005), D'Haultfoeuille (2011), and Andrews (2011) for discussions 
of completeness and bounded completeness. 

Prechet differentiability and the rank condition are not sufficient for local identification in 
an open ball Me around ao, as we further explain below. One condition that can be added to 
obtain local identification is that m' : A'^ B \s onto. 

Theorem 1: If m[a) is Frechet dijjerentiable at ao, the rank condition is satisfied on N' 
= Ne for some e > 0, and m' : B is onto, then uq is locally identified on for some £ 

with < £ < £. 

This result extends previous nonparametric local identification results by only requiring 
pointwise Frechet differentiability at ao, rather than continuous Frechet differentiability in a 
neighborhood of ao- This extension may be helpful for showing local identification in nonpara- 
metric models, because conditions for pointwise Frechet differentiability are simpler than for 
continuous differentiability in nonparametric models. 

Unfortunately, the assumption that m' is onto is too strong for many econometric models, 
including many nonparametric conditional moment restrictions. An onto m' implies that m' has 
a continuous inverse, by the Banach Inverse Theorem (Luenberger, 1969, p. 149). The inverse 
of m' is may not be continuous for nonparametric conditional moment restrictions, as discussed 
in Newey and Powell (2003). Indeed, the discontinuity of the inverse of m' is a now well known 
ill-posed inverse problem that has received much attention in the econometrics literature, e.g. 
see the survey of Carrasco, Florens, and Renault (2007). Thus, in many important econometric 
models Theorem 1 cannot be applied to obtain local identification. 

It turns out that ao may not be locally identified on any open ball in ill-posed inverse 
problems, as we show in an example below. The problem is that for infinite dimensional spaces 
m'(a — ao) may be small when a — ao is large. Consequently, the effect of nonlinearity, that 
is related to the size of a — ao, may overwhelm the identifying effect of nonzero m'{a — ao), 
resulting in m(a) being zero for a close to ao- 
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We approach this problem by restricting the deviations a — to be small when m'(a — oq) 
is small. The restrictions on the deviations will be related to the nonlinearity of m(a) via the 
following condition: 

Assumption 2: There are L > 0,r > 1 and a set N" such that for all a £ M", 
\\m{a) — m{ao) — m'{a — ao)||g < L\\a — ao||^ ■ 

This condition is general. It includes the linear case where M" = A and L = 0. It also 
includes Frechet differentiability, where r = 1, L is any positive number and N" = A4 for any 
sufficiently small e > 0. Cases with r > 1, are analogous to Holder continuity of the derivative in 
finite dimensional spaces, with r = 2 corresponding to twice continuous Frechet differentiability. 
We would only have r > 2 when the second derivative is zero. This condition applies to many 
interesting examples, as we will show in the rest of the paper. The term L ||a — ao||^ represents 
a magnitude of nonlinearity that is allowed for a gM" . The following result uses Assumption 
2 to specify restrictions on a that are sufficient for local identification. 

Theorem 2: If Assumption 2 is satisfied then oq is locally identified on M = M" CiAf'" 
with M'" ={a : \\m'{a — ao)!!^ > L\\a — ao||^}- 

The strict inequality in M'" is important for the result. It does exclude ao from A^, but that 
works because local identification specifies what happens when a ^ aQ. This result includes 
the linear case, where L = 0, J\f" = A, and J\f = J\f"' = J\f'. It also includes nonlinear cases 
where only Frechet differentiability is imposed, with r = 1 and L equal to any positive constant. 
In that case M" = Me for some e small enough and a G M'" restricts a — ao to a set where 
the inverse of m' is continuous by requiring that ||m'(a — ao)!!^ > ||o — ooll^- In general, 
by L||a — aoll^ ^ 0) we have M'" C J\f' for M' from Assumption 1, so the rank condition 
is imposed by restricting attention to the M of Theorem 2. Here the rank condition is still 
important, since if it is not satisfied on some interesting set A/"', Theorem 2 cannot give local 
identification on an interesting set M. 

Theorem 2 forges a link between the curvature of m(a) as in Assumption 2 and the identifi- 
cation set M. An example is a scalar a and twice continuously differentiable m(a) with bounded 
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second derivative. Here Assumption 2 will be satisfied with r = 2, L = sup^, \(Pm{a)/da'^\ /2, 
and M" equal to the real line, where |-| denotes the absolute value. Assumption 1 will 
be satisfied with M' equal to the real line as long as m' = dm{aQ)/da is nonzero. Then 
A/"'" = {a : \a — ao\ < I'ti'I}. Here \m'\ is the minimum distance a must go from oq 
before m(a) can "bend back" to zero. In nonparametric models M'" will be an analogous set. 

When r = 1 the set M'" will be a linear cone with vertex at ao, which means that if a G M'" 
then so is Xa + (1 — X)ao for A > 0. In general, M'" is not convex, so it is not a convex cone. 
For r > 1 the set J\f"' is not a cone although it is star shaped around ao, meaning that for any 
a G M'" we have Aa + (1 - A)ao G M'" for < A < 1. 

Also, if r > 1 then for any L > and 1 < r' < r there is 5 > such that 

J\fsn{a : \\m'{a - ao)||g > L\\a- aoll^} ^ Ms<^N"'. 

In this sense a G N'" as assumed in Theorem 2 is less restrictive the larger is r, i.e. the local 
identification neighborhoods of Theorem 2 are "richer" the larger is r. 

Restricting the set of a to be smaller than an open ball can be necessary for local identifica- 
tion in nonparametric models, as we now show in an example. Suppose a = (ai, 02, ■••) is a se- 
quence of real numbers. Let (pi,p2> •••) be probabilities, pj > 0, Yl^i Pj = ^- Let f{x) be a twice 
continuously differentiable function of a scalar x that is bounded with bounded second deriva- 
tive. Suppose f{x) = if and only if x G {0, 1} and df{0)/dx = 1. Let m{a) = (/(ai), /(a2)) •••) 
also be a sequence with ||m(a)||0 = ( • Then for HaH^^ = iYl'jLiPj'^j) 
the function m{a) will be Frechet differentiable at ao = 0, with m'h = h. A fourth moment 
norm for a, rather than a second moment norm, is needed to make m(a) Frechet differentiable 
under the second moment norm for m{a). Here the map m' is not onto, even though it is the 
identity, because the norm on A is stronger than the norm on B. 

In this example the value ao = is not locally identified by the equation m{a) = on 
any open ball in the norm \\a — ao||_4- To show this result consider which has zeros in the 
first k positions and a one everywhere else, i.e., a = (0, 0, 1, 1, ...). Then m{a'') = and for 

= ET=k+iPj we have \\a'' - ao|U = (Ej°1i = {^'f^^ ^ 0. Thus, we 
have constructed a sequence of a'^ not equal to ao such that m(a'^') = and ||a^ — Q!o||_4 — > 0. 
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We can easily describe the set N of Theorem 2 in this example, on which = will be 
locally identified. By the second derivative of / being bounded, Assumption 2 is satisfied with 
J\[" = ^, r = 2, and L = sup„ \d'^ f{a) / da^\/2, where L > 1 by the fact that /'(O) = 1 and 
/(O) = /(I) = (an expansion gives = /(I) = 1 + d"^ f (a) / da'^ for < a < 1). Then, 
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The sequence (a'^)^]^ given above will not be included in this set because L > 1. A subset of 
M (on which ao is locally identified) is {a = {ai,a2,---) : \aj\ < L~^,{j = 1,2,...)}. 

It is important to note that Theorems 1 and 2 provide sufficient, and not necessary, condi- 
tions for local identification. The conditions of Theorems 1 and 2 are sufficient for 

||m(a) — m'{a — ao)||g / ||?n'(a — ao)!!^ (2-2) 

that implies m(a) ^ 0, to hold on M. The set where (|2.2p may be larger than the M of Theorem 
1 or 2. We have focused on the M of Theorem 1 or 2 because those conditions and the associated 
locally identified set M are relatively easy to interpret. 

Assumption 1 may not be needed for identification in nonlinear models, although local 
identification is complicated in the absence of Assumption 1. Conditions may involve nonzero 
higher order derivatives. Such results for parametric models are discussed by, e.g., Sargan 
(1983). Here we focus on models where Assumption 1 is satisfied. 

3 Local Identification in Hilbert Spaces 

The restrictions imposed on a in Theorem 2 are not very transparent. In Hilbert spaces it is 
possible to give more interpretable conditions based on a lower bound for ||?Ti'(a — ao)||g. Let 
(•, •) denote the inner product for a Hilbert space. 

Assumption 3: {A, CLnd {B, ||-||g) are separable Hilbert spaces and either a) there is 

a set M', an orthonormal basis {(f)i,(p2, ■■■} ^ A, and a bounded, positive sequence (/ii, ;U2, •••) 
such that for all a £ Af', 

oo 

2 ^ ,,2/„, „, i, \2. 



I {a - ao)||g > ^l^]{a - ao, 
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or b) m' is a compact linear operator with positive singular values (/Ui,/i2, •••)• 

The hypothesis in b) that m' is a compact operator is a mild one when m' is a conditional 
expectation. Recall that an operator m : A B is compact if and only if it is continuous 
and maps bounded sets in A into relatively compact sets in B. Under very mild conditions, 
m(a) = E[a{X)\W] is compact: See Zimmer (1990, chapter 3), Kress (1999, section 2.4) and 
Carrasco, Florens, and Renault (2007) for a variety of sufficient conditions. When m' in b) is 
compact there is an orthonormal basis : j = 1, . . .} for A with 

oo 

\\m'{a - ao)||g = - c^oAj)'^, 

i=i 

where /i^ are the eigenvalues and the eigenfunctions of the operator m'*m', so that condition 
a) is satisfied, where m'* denotes the adjoint of m'. The assumption that the singular values 
are all positive implies the rank condition for J\f' = A. Part a) differs from part b) by imposing 
a lower bound on ||m'(a — ao)||g only over a subset M' of A and by allowing the basis {(pj} to 
be different from the eigenfunction basis of the operator m'*m' . In principle this allows us to 
impose restrictions on a — hke boundedness and smoothness, which could help Assumption 
3 a) to hold. For similar assumptions in estimation context, see, e.g., Chen and ReiB (2011) 
and Chen and Pouzo (2012). 

It turns out that there is a precise sense in which the rank condition is satisfied for most 
data generating processes, if it is satisfied for one, in the Hilbert space environment here. In 
this sense the rank condition turns out to be generic. Let A and B be separable Hilbert spaces, 
and J\f' C A. Suppose that there exists at least one compact linear operator: K, : A^ B which 
is injective , i.e. ]C5 = for 6 ^ A \i and only if = 0. This is an infinite-dimensional analog 
of the order condition, that for example rules out B having smaller finite dimension than A 
(e.g. having fewer instruments than right-hand side endogenous variables in a linear regression 
model). The operator m' : M' ^ B \s generated by nature as follows: 

1. The nature selects a countable orthonormal basis {ipj} of cardinality < oo in ^ and an 
orthonormal set {'^j} of equal cardinality in B. 
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2. The nature samples a bounded sequence of real numbers {Xj} according to a probability 
measure r] whose each marginal is dominated by the Lebesgue measure on M, namely 
heh{A) = implies rj{{Xj G A}) = for any measurable ^4 C M for each j. 

Then nature sets, for some scalar number k > 0, and every 6 G J\f' 

N 

m'6 = KC^Xj{(j)j,6)ipj). (3.3) 
j=o 

This operator is properly defined on Af' := {6 G A : m'6 £ B}. 

Lemma 3 (1) In the absence of further restrictions on m' , the algorithms obeying conditions 
1 and 2 exist. (2) If m' is generated by any algorithm that obeys conditions 1 and 2, then 
probability that m' is not injective over N' is zero, namely Pr^{3(^ G M' : 6 ^ and m'6 = 
0} = 0. Moreover, Assumption 3 holds with fij = \K.Xj\ with probability one under rj. 

In Appendix B we provide examples for the case A = B = L'^[0,l] that highhght the range 
of algorithms permitted by conditions 1 and 2 above, including cases where various restric- 
tions on m' are imposed: boundedness, compactness, weak positivity, and density restrictions. 
Genericity arguments use the idea of randomization, and are often employed in economic the- 
ory, functional analysis, and probability theory, see, e.g., Anderson and Zame (2000), Marcus 
and Pisier (1981), Ledoux and Talagrand (2011). Andrews (2011) previously used a related no- 
tion of genericity, called prevalence within bounded sets, to argue that rich classes of operators 
induced by densities in nonparametric IV are L^-complete. The result above uses a somewhat 
different notion of genericity than prevalence. We also note that while this construction implies 
identification with probability one, it does not regulate in any way the strength of identification, 
and hence has no bearing on the choice of an inferential method. 

In what follows let bj = {a — ao, 4>j),j = 1, 2, ... denote the Fourier coefficients for a — ao, 
so that a = uq + Xljli ^j'Pj- Under Assumptions 2 and 3 we can characterize an identified set 
in terms of the Fourier coefficients. 

Theorem 4: If Assumptions 2 and 3 are satisfied then oq is locally identified on Af = Af" Ci 
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N'", where 

oo oo oo 

When r = 1 it is necessary for a E AA'" that the Fourier coefficients bj where fij is small not 
be too large relative to the Fourier coefficients where /i^ is large. In particular, when r = 1 any 
Q 7^ Qo with bj = for all j with fij > L will not be an element of M'". When r > I we can use 
the Holder inequality to obtain a sufficient condition for a E M'" that is easier to interpret. 

Corollary 5: If Assumptions 2 and 3 are satisfied, with L > 0,r > 1, then oq is locally 
identified on M = M"r\M"' where M'" = {a = ao + Y.f=i h^j ■ ET=i l^j^^^'^'^^^'j < 

For a to be in the J\f"' of Corollary 5 the Fourier coefficients bj must vanish faster than 
^ grows. In particular, a sufficient condition for a E J\f"' is that \bj\ < {fij/L)^^^'^~^^ Cj 
for any positive sequence Cj with Yl'jLi '^'j ~ ^- These bounds on bj correspond to a hyperrect- 
angle while the M'" in Corollary 5 corresponds to an ellipsoid. The bounds on bj shrink as L 
increases, corresponding to a smaller local identification set when more nonlinearity is allowed. 
Also, it is well known that, at least in certain environments, imposing bounds on Fourier co- 
efficients corresponds to imposing smoothness conditions, like existence of derivatives; see for 
example Kress (Chapter 8, 1999). In that sense the identification set in Corollary 5 imposes 
smoothness conditions on the deviations a — from the truth. 

The bound imposed in J\f"' of Corollary 5 is a "source condition" under Assumption 3 b) 
and is similar to conditions used by Florens, Johannes and Van Bellegem (2011) and others. 
Under Assumption 3 a) it is similar to norms in generalized Hilbert scales, for example, see 
Engl, Hanke, and Neubauer (1996) and Chen and Reifi (2011). Our Assumption 3 a) or 3 b) 
are imposed on deviations a — oq, while the above references all impose on true function ao 
itself as well as on the parameter space hence on the deviations. 

To illustrate the results of this Section we consider an endogenous quantile example where 
< r < 1 is a scalar, 

p{Y,X,a) = l{Y <a{X))-T, 
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A = {a(-) : E[a(X)2] < oo}, and B = {a(-) : E[a(VF)2] < oo}, with the usual Hilbert spaces of 
mean-square integrable random variables. Here we have 

m{a) = E[l(y < a{X))\W] - r. 

Let /y W) denote the conditional probability density function (pdf) of Y given X and W, 
fx{x\W) the conditional pdf of X given W, and f{x) the marginal pdf of X. 

Theorem 6: // fY{y\X,W) is continuously differentiable in y with \dfY{y\X,W)/dy\ < Li, 
fx{x\W) < L2/(x), and m'h = E[fY{ao{X)\X,W)h{X)\W] satisfies Assumption 3, then oq is 
locally identified on 

oo oo 

M = {a = ao + J2 bj^j ^^-Yl ^V^'] < (^1^2)-'}. 
j=i j=i 

This result gives a precise link between a neighborhood on which is locally identified 
and the bounds Li and L2. Assumption 3 b) will hold under primitive conditions for m' to 
be complete, that are given by Chernozhukov, Imbens, and Newey (2007). Theorem 6 corrects 
Theorem 3.2 of Chernozhukov, Imbens, and Newey (2007) by adding the bound on Xlj^i ^'j/l^'j- 
It also gives primitive conditions for local identification for general X while Chernozhukov and 
Hansen (2005) only gave primitive conditions for identification when X is discrete. Horowitz and 
Lee (2007) impose analogous conditions in their paper on convergence rates of nonparametric 
endogenous quantile estimators but assumed identification. 

4 Semiparametric Models 
4.1 General Theory 

In this section, we consider local identification in possibly nonlinear semiparametric models, 
where a can be decomposed into a p x 1 dimensional parameter vector /3 and nonparametric 
component g, so that a = {/3,g). Let | • | denote the Euclidean norm for j3 and assume g ^ Q 
where ^ is a Banach space with norm such as a Hilbert space. We focus here on a 

conditional moment restriction model 

E[p(y,X,/3o,5o)|W^] = 0, 
[12] 



where p{y,x,l3,g) is a J x 1 vector of residuals. Here m(a) = E[/?(y, X, /3, (7)! W^] will be 
considered as an element of the Hilbert space of J x 1 random vectors with inner product 

{a,b) = E[a{Wfb{W)]. 

The differential m'(a — oq) can be expressed as 

m'{a - ao) = m'p{l3 - /3o) + m'g{g - go), 

where m'^ is the derivative of m{/3,go) = E[p{Y, X , 13 , gQ)\W] with respect to /? at /3o and m'g 
is the Gateaux derivative of m(/3o,5) with respect to g at gQ. To give conditions for local 
identification of /3o in the presence of the nonparametric component g it is helpful to partial 
out g. Let M be the closure of the linear span M of m'g{g — go) for g £ M'g where N'g will be 
specified below. In general M because the linear operator m'g need not have closed range 
(like ml onto, a closed range would also imply a continuous inverse, by the Banach inverse 
theorem). For the fc*^ unit vector e^, (fc = 1, let 

a = arg mmE[{m'p{W)ek - CiW)]'' {m'p{W)ek - Q{W)]], 

which exists and is unique by standard Hilbert space results; e.g. see Luenberger (1969). Define 
H to be the p x p matrix with 

Ujk := E [{m'p{W)ej - C*{W)f {m'f,{W)ek - CkiW)}] , {j,k = 1, ...,p). 

The following condition is important for local identification of /3o- 

Assumption 4: The mapping m' -.W x M'g — > B is continuous and H is nonsingular. 

This assumption is similar to those first used by Chamberlain (1992) to establish the possi- 
bility of estimating parametric components at root-n rate in semi-parametric moment condition 
problems; see also Ai and Chen (2003) and Chen and Pouzo (2009). In the local identification 
analysis considered here it leads to local identification of (3q without identification of g when 
m(/3o, g) is linear in g. It allows us to separate conditions for identification of /3o from conditions 
for identification of g. Note that the parameter /? may be identified even when H is singular, 
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but that case is more complicated, as discussed at the end of Section 2, and we do not analyze 
this case. 

The following condition controls the behavior of the derivative with respect to /3: 

Assumption 5: For every e > there is a neighborhood B of (3q and a set J\fg such that 
for all g E Mg with probability one E[p{Y, X , (3 , g)\W] is continuously differentiable in (3 on B 
and 

sup /e[sup |9E[p(y, X, /?, g)\W]/dp - 5E[p(y, X, /3o, 5o)|VF]/a/3|2] ^ ^_ 

It turns out that Assumptions 4 and 5 will be sufficient for local identification of /3o when 
m{/3Q,g) is linear in g, i.e. for m{/3,g) = to imply j3 = (Sq when {/3,g) is in some neighborhood 
of (/3o,(7o)- This works because Assumption 4 partials out the effect of unknown g on local 
identification of (Sq. 

Theorem 7: If Assumptions 4 md 5 are satisfied and m{j3Q,g) is linear in g then there 
is an e > such that for B and Mg from Assumption 5 and J\fg from Assumption 4, /3o 
locally identified for Af = B x (AfgHAfg). If, in addition, Assumption 1 is satisfied for m'g and 
Mg r\Mg replacing m! and M' then = (/3o)5o) locally identified for M. 

This result is more general than Florens, Johannes, and Van Bellegem (2012) and Santos 
(2011) since it allows for nonlinearities in /3, and dependence on g of the partial derivatives 
d^[p{Y,X,p,g)\W]/dp. When the partial derivatives d^[p{Y,X,p,g)\W]/dp do not depend 
on g, then Assumption 5 could be satisfied with Mg = Q, and Theorem 7 could then imply local 
identification of /3o in some neighborhood of /?o only. 

For semiparametric models that are nonlinear in g we can give local identification results 
based on Theorem 2 or the more specific conditions of Theorem 4 and Corollary 5. For brevity 
we give just a result based on Theorem 2. 

Theorem 8: If Assumptions 4 cLnd 5 are satisfied and m{Po,g) satisfies Assumption 2 
with M" = Mg , then there is an e > such that for B and Mg from Assumption 5, Mg from 
Assumption 4, and 

M'g" = {g : \\m'g{g - go)\\^ > e~^L \\g - go\\'^} 
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it is the case that ao = {f^o, go) is locally identified for M = B x [Mg H Mg H Mg H J^g')- 
4.2 A Single Index IV Example 

Econometric applications often have too many covariates for fully nonparametric estimation 
to be practical, i.e. they suffer from the curse of dimensionality. Econometric practice thus 
motivates interest in models with reduced dimension. An important such model is the single 
index model. Here we consider a single index model with endogeneity, given by 

Y = goiXi + Xj/3o) + U, E[U\W] = 0, (4.4) 

where (3q is a vector of unknown parameters, go{-) is an unknown function, and W are instru- 
mental variables. Here the nonparametric part is just one dimensional rather than having the 
same dimension as X. This model is nonlinear in Euclidean parameters, and so is an example 
where our results apply. Our results add to the literature on dimension reduction with endo- 
geneity, by showing how identification of an index model requires fewer instrumental variables 
than a fully nonparametric IV model. We could generalize the results to multiple indices but 
focus on a single index for simplicity. 

The location and scale of the parametric part are not identified separately from go, and 
hence, we normalize the constant to zero and the coefficient of Xi to 1. Here 

m{a){W) = E[Y - g{X, + X^P)\W]. 

Let V = Xi + X2P0 and for differentiable go{V) let 

m'0 = -E[g',{V)X^\W]. 

Let Cj denote the projection of m'^Cj = —E[gQ{V)X2j\W] on the mean-square closure of the set 
{E[/i(y)|VF] : E[/i(T/)2] < 00} and H the matrix with Ujk = E[(m^ej - q){m'pek - Q)]- 

Theorem 9: Consider the model of equation l^4-4\ )- If o-) 5o(^) is continuously differentiable 
with hounded derivative g^iV) satisfying |(7o(^) ~ 5o(^)l — ^g\^ ~ ^1 some Cg > 0, b) 
E[|X2|*^] < 00, and c) U is nonsingular, then there is a neighborhood B of (3q and 6 > such 
that for 

Mg = {g : g{v) is continuously differentiable and sup \g' {v) — S'o(f)| < 5} 

V 
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/So is locally identified for M = BxMg. Furthermore, if there is M'g such that YjlgiV) — gQ{y)\W\ 
is bounded complete on the set {g{V) — goiV) : g £ Mg} then {f3o,go) is locally identified for 

M = Bx {Mlr\M'g). 

Since this model includes as a special case the linear simultaneous equations model the usual 
rank and order conditions are still necessary for 11 to be nonsingular for all possible models, and 
hence are necessary for identification. Relative to the linear nonparametric IV model in Newey 
and Powell (2003) the index structure lowers the requirements for identification by requiring that 
m'gh = —'Ej[h[V)\W] be complete on AA^ rather than completeness of the conditional expectation 
of functions of X given W . For example, it may be possible to identify /3o and g^ with only 
two instrumental variables, one of which is used to identify g^ and functions of the other being 
used to identify /3o- 

To further explain we can give more primitive conditions for nonsingularity of 11. The 
following result gives a necessary condition for 11 to be nonzero (and hence nonsingular) as well 
as a sufficient condition for nonsingularity of 11. 

Theorem 10: Consider the model of ^.4\ )- If n is nonsingular then the conditional distri- 
bution of W given V is not complete. Also, if there is a measurable function T{W) such that 
the conditional distribution of V given W depends only on T(W) and for every p x I vector 
A 7^ 0, E[5rg(y)A"^X2|M^] is not measurable with respect to T{W), then 11 is nonsingular. 

To explain the conditions of this result note that if there is only one variable in W then 
the completeness condition (of W given V) can hold and hence IT can be singular. If there 
is more than one variable in W then generally completeness (of W given V) will not hold, 
because completeness would be like identifying a function of more than one variable (i.e. W) 
with one instrument (i.e. V). IfW and V are joint Gaussian and V and W are correlated then 
completeness holds (and hence 11 is singular) when W is one dimensional but not otherwise. In 
this sense having more than one instrument in is a necessary condition for nonsingularity 
of n. Intuitively, one instrument is needed for identification of the one dimensional function 
go(y) so that more than one instrument is needed for identification of (3. 
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The sufficient condition for nonsingularity of IT is stronger than noncompleteness. It is 
essentiahy an exclusion restriction, where E[(7g(y)X2| Vl^] depends on Vl^ in a different way than 
the conditional distribution of V depends on W. This condition can be shown to hold if W and 
V are Gaussian, W is two dimensional, and E[(7Q(l^)X2|Ty] depends on all of W. 

5 Semiparametric CCAPM 

Consumption capital asset pricing models (CCAPM) provide interesting examples of nonpara- 
metric and semiparametric moment restrictions; see Gallant and Tauchen (1989), Newey and 
Powell (1988), Hansen, Heaton, Lee, and Roussanov (2007), Chen and Ludvigson (2009), and 
others. In this section, we apply our general theorems to develop new results on identification 
of a particular semiparametric specification of marginal utility of consumption. Our results 
could easily be extended to other specifications, and so are of independent interest. 

To describe the model let Ct denote consumption level at time t and q = Ct/Ct-i be 
consumption growth. Suppose that the marginal utility of consumption at time t is given by 

MUt = C^^'go{Ct/Ct-i) = Cr^°5o(Q), 

where go{c) is an unknown positive function. For this model the intertemporal marginal rate 
of substitution is 

SoMUt+i/MUt = Soc-^^go{ct+i)/go{ct), 

where < (^o < 1 is the rate of time preference. Let Rt+i = {Rt+i,i, Rt+ij)"^ be a J x 1 
vector of gross asset returns. A semiparametric CCAPM equation is then given by 

E[Rt+i5oc;;!ngo{ct+i)/go{ct)}\Wt] = e, (5.5) 

where Wt = (Zt, ct) is a vector of variables observed by the agent at time t, and e is a J x 1 vector 
of ones. This corresponds to an external habit formation model with only one lag as considered 
in Chen and Ludvigson (2009). As emphasized in Cochrane (2005), habit formation models 
can help explain the high risk premia embedded in asset prices. We focus here on consumption 
growth Ct = Ct/Ct~i to circumvent the potential nonstationarity of the level of consumption, 
see Hall (1978), as has long been done in this literature, e.g. Hansen and Singleton (1982). 
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From economic theory it is known that under complete markets there is a unique intertem- 
poral marginal rate of substitution that solves equation (jS.Sp . when Rt is allowed to vary over 
all possible vectors of asset returns. Of course that does not guarantee a unique solution for a 
fixed vector of returns Rt- Note though that the semiparametric model does impose restrictions 
on the marginal rate of substitution that should be helpful for identification. We show how 
these restrictions lead to local identification of this model via the results of Section 4. 

This model can be formulated as a semiparametric conditional moment restriction by letting 
Y = {Rf^„ct+i,ctf, /3 = (J, 7)^, W = {Z^,ctf, and 

P{y,^,g) = Rt+iSc^^i9{ct+i) - g{ct)e. (5.6) 

Then multiplying equation (|5.5p through by goict) gives the conditional moment restriction 
E[/>(y, /3o, 5o)| W^] = 0. Let At = i?f+i(5oc^^. The nonparametric rank condition (Assumption 
1 for g) will be uniqueness, up to scale, of the solution g^ of 

E[Atg{ct+i)\Wt] = g{ct)e. (5.7) 

This equation differs from the linear nonparametric IV restriction where the function goi^) 
would solve E[y|Ty] = E[^(X)|M^]. That equation is an integral equation of the first kind while 
equation (15. 7p is a homogeneous integral equation of the second kind. The rank condition for 
this second kind equation is that the null space of the operator E[j4jg((Q4.i)|Tyt] — g{ct)e is one- 
dimensional, which is different than the completeness condition for first kind equations. This 
example illustrates that the rank condition of Assumption 1 need not be equivalent to com- 
pleteness of a conditional expectation. Escanciano and Hoderlein (2010) and Lewbel, Linton, 
and Srisuma (2012) have previously shown how homogenous integral equations of the second 
kind arise in CCAPM models, though their models and identification results are different than 
those given here, as further discussed below. 

Let Xt = (l/(5o, — ln(Q+i))"^. Then differentiating inside the integral, as allowed under 
regularity conditions given below, and applying the Gateaux derivative calculation gives 

m'^iW) = E[Atgo{ct+i)X^\Wt], m'^g = E[Atg{ct+i)\Wt] - g{ct)e. 
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When E[^((7(cf+i)| Wj] is a compact operator, as holds under conditions described below, it 
follows from the theory of integral equations of the second kind (e.g. Kress, 1999, Theorem 3.2) 
that the set of nonparametric directions M will be closed, i.e. 

M=M = {E[Atg{ct+i)\Wt] - g{ct)e : \\g\\g < oo}, 

where we will specify \\g\\g below. Let 11 be the two-dimensional second moment matrix 11 of 
the residuals from the projection of each column of m|Jj on A^, as described in Section 4. Then 
nonsingularity of 11 leads to local identification of /3o via Theorem 7. 
To give a precise result let A be any positive number, 

A = (l + |i2ml)[2 + |ln(Q+i)|2] sup c,7i, 

7e[7o— A,7o+A] 

Il5llg = VnE[Dt\Wt]^9ict+in 
The following assumption imposes some regularity conditions. 

Assumption 6: {Rt,ct,Wt) is strictly stationary^ E[Dj] < oo, and \\go\\g < oo. 

The following result applies Theorem 7 to this CCAPM. 

Theorem 11: Suppose that Assumption 6 is satisfied. Then the mapping m' -.MP xQ — > B 
is continuous and if 11 is nonsingular there is a neighborhood B of (3q and e > such that for 
^9 ~ {9 ■ Wd ~ 9o\\g < £}) /3o is locally identified for J\f = B xMg . If in addition m'g{g — g^) 7^ 
for all g go and g G Afg then {(3oi9o) is locally identified for Af = B x J\fg . 

Primitive conditions for nonsingularity of IT and for m'g{g — go) 7^ when g ^ go are needed 
to make this result interesting. It turns out that some completeness conditions suffice, as shown 
by the following result. Let f^yy{c,'w) denote the joint pdf of Wt), and /c(c) and fyy{w) 

the marginal pdfs of Q+i and Wt respectively. 

Theorem 12: Suppose Assumption 6 is satisfied, Vi{go{ct) = 0) = 0, for some w{Zt) and 
Wt = {w{Zt),Ct), {ct+i,Wt) is continuously distributed and there is some j and corresponding 
Atj = 5oRt+l,jC^_^l satisfying 

E[4./,(Q+i)-i/iy(^*)"'4iy(ct+i,^t)] < 00. (5.8) 
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Then (a) if 'E[Atjh{ct+i, Ct)\Wt] = implies h{ct+i,Ct) = 0, then H is nonsingular; (h) if for 
Sonne c with goic) ^ 0, E[Atjh{ct+i)\'w{Zt),ct = c] = implies h{ct+i) = 0, then go is the 
unique solution to 'E[Atg{ci^i)\Wt] = g{ct) up to scale. 

Equation (|5.8p implies E[^jj(/(Q+i)| W^] is a Hilbert-Schmidt integral operator and hence 
compact. Analogous conditions could be imposed to ensure that is closed. The suffi- 
cient conditions for nonsingularity of 11 involve completeness of the conditional expectation 
'E\Atjh{ct+i-,ct)\Wt\. As previously noted, sufficient conditions for completeness can be found 
in Newey and Powell (2003) and Andrews (2011) and completeness is generic in the sense of 
Andrews (2011) and Lemma 3. 

Condition b) is weaker than condition a). Condition (b) turns out to imply global identi- 
fication of /3o and g^ (up to scale) if go{c) is bounded, and bounded away from zero. Because 
we focus on applying the results of Section 4, we reserve this result to Theorem A. 3 in the 
Appendix. Even with global identification the result of Theorem 12 (a) is of interest, because 
nonsingularity of IT will be necessary for 70 to be estimable at a root-n rate. The identifi- 
cation result for 70 in Theorem A. 3 involves large and small values of consumption growth, 
and so amounts to identification at infinity, that generally does not lead to root-n consistent 
estimation, e.g. see Chamberlain (1986). 

A different approach to the nonparametric rank condition, that does not require any in- 
strument w{Zt) in addition to q, can be based on positivity of go{c). The linear operator 
E[^jjg'(Q+i)|Q] and g{c) will be infinite dimensional (functional) analogs of a positive ma- 
trix and a positive eigenvector respectively, by equation (j5.7p . The Perron- Frobenius Theorem 
says that there is a unique positive eigenvalue and eigenvector (up to scale) pair for a pos- 
itive matrix. A functional analog, based on Krein and Rutman (1950), gives uniqueness of 
(70(c), as well as of the discount factor 60. To describe this result let r(c, s) = E[Rt+ij\ct+i = 
s,Ct = c], f{s,c) be the joint pdf of (q+i,q), /(c) the marginal pdf of q at c and K{c,s) = 
r{c,s)s~'^°f{s,c)/[f{s)f{c)]. Then the equation E[Atjg{ct+i)\ct] = g{ct) can be written as 



for 5 = 5q. Here the matrix analogy is clear, with /^(c, s)f{s) being like a positive matrix, g{c) 




(5.9) 
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an eigenvector, and 6 an eigenvalue. 

Theorem 13: Suppose that {Rtj,ct) is strictly stationary, f{c,s) > andr(c,s) > almost 
everywhere, and f f K{c, s)"^ f{c)f{s)dcds < oo. Then equation i5.y\) has a unique nonnegative 
solution {6o,go), with 6o > 0, go > almost everywhere, and E[go{ct)'^] = 1. 

The conditions of this result include r(c, s) > 0, that will hold if Rt+ij is a positive risk free 
rate. Under square-integrability of K, we obtain global identification of the pair {6o,go). The 
uniqueness of go{c) in the conclusion of this result implies the nonparametric rank condition. 
Note that by iterated expectation and inclusion of Rt+i,j in Rt+i any solution to equation ()5.7p 
must also satisfy equation (j5.9p . Thus Theorem 13 implies that gQ is the unique solution to 
()5.7p . Theorem 13 actually gives more, identification of the discount factor given identification 
of 70. 

Previously Escanciano and Hoderlein (2010) and Lewbel, Linton and Srisuma (2012) had 
considered nonparametric identification of marginal utility in consumption level, MU{Ct), by 
solving the homogeneous integral equation of the second kind: 

E[Rt+ij5oMU{Ct+i)\Ct] = MU{Ct). 

In particular, Escanciano and Hoderlein (2010) previously gave an insightful identification result 
based on positive marginal utility and Perron-Frobenis theorem, but assuming that Ct has a 
compact support and Rt+ij is a risk-free rate. Lewbel, Linton and Srisuma (2012) previously 
use a genericity argument for identification of MU{Ct). In comparison, we also use positivity, 
but identify g(c) rather than the marginal utility, and base the result on Krein-Rutman's version 
of the Perron- Frobenius theorem, which allows us avoid making boundedness restrictions on the 
support of Ct and even on q. Finally, Perron- Frobenius theory has been extensively used by 
Hansen and Scheinkman (2009, 2012) and Hansen (2012) in their research on the long-run risk 
and dynamic valuations in a general Markov environment in which their valuation operators 
may not be compact. Our results follow the same general path, though we apply the Krein- 
Rutman theorem (and its extensions) to a different operator than theirs. 

The models considered here will generally be highly overidentified. We have analyzed iden- 
tification using only a single asset return Rt+ij. The presence of more asset returns in equation 

[21] 



(jS.Sp provides overidentifying restrictions. Also, in Theorem 12 we only use a function w{Zt) of 
the available instrumental variables Zt in addition to q. The additional information in Zt may 
provide overidentifying restrictions. These sources of overidentification are familiar in CCAPM 
models. See, e.g., Hansen and Singleton (1982) and Chen and Ludvigson (2009). 

6 Conclusion 

We provide sufficient conditions for local identification for a general class of semiparametric and 
nonparametric conditional moment restriction models. We give new identification conditions 
for several important models that illustrate the usefulness of our general results. In particular, 
we provide primitive conditions for local identification in nonseparable quantile IV models, 
single-index IV models, and semiparametric consumption-based asset pricing models. 

Appendix 

7 Proofs for Section 2 

7.1 Proof of Parametric Result 

By rank{m') = p, the nonnegative square root 7] of the smallest eigenvalue r/^ of {m')'^m' is 
positive and \nn'h\ > r]\h\ for h G W. Also, by the definition of the derivative there is e > 
such that \m{a) — m(ao) ~ m'[a — ao)| / W ~ Q^ol < 'H W ~ "^ol < £ with a ^ oq. Then 

|m(a) — 7n'(a — ao)| \m{a) — 'm{aQ) — m' {a — aQ)\ |a — ao| ^rj ^ (7 10) 

|m'(a — ao)| |a — ckoI |m'(a — ao)| r/ 

This inequality implies m{a) / 0, so ao is locally identified on {q : |a — ao| < e].Q.E.D. 

7.2 Proof of Theorem 1 

If m'h = m'h for h ^ h then for any A > we have m'h = for h = X{h — h) ^ 0. For A small 
enough h would be in any open ball around zero. Therefore, Assumption 1 holding on an open 
ball containing oq implies that m' is invertible. By m' onto and the Banach Inverse Theorem 
(Luenberger, 1969, p. 149) it follows that {m')~^ is continuous. Since any continuous linear 
map is bounded, it follows that there exists r] > such that ||m'(a — ao)||i3 > ?? ||ck — c^oll^ for 
all a £ A. 
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Next, by Frechet differentiability at qq there exists an open ball Ne centered at ao such 
that for all a G Me, ol / ctO) 



|m(a) — m(Q;o) — m'(a — ao)!!^ 



< T]. (7.11) 



11" - aolU 
Therefore, at all such a 7^ aoi 

||m(a) — m'(a — ao)|lB ||m(a) — m(Qo) — m'(a — ao)||g ||a — aolly^ 



||m'(a - ao)||B ||a-ao||^ ||m'(a - ao)!!^ 
< ?7/r/ = 1. 

Therefore, as in the proof of the parametric result above, m(a) 7^ 0. Q.E.D. 
7.3 Proof of Theorem 2 

Consider a SAA with a ^ aQ. Then 

||m(a) — m'(a — Qo)||g ||m(Q) — m(Qo) — — oq)!!/? ^ ~ cioll^ 



||m'(Q; - ao)|lB - ao)|lB ||m'(Q! - ao)|le 

The conclusion follows as in the proof of Theorem 1. Q.E.D. 

8 Proofs for Section 3 
8.1 An Example for Lemma 3 

It is useful to give explicit examples of the randomization algorithms obeying conditions 1 and 
2 listed in Section 2. Suppose A = B = L^[0,1], and that m' is an integral operator 

m'5 = [ K{-,t)5{t)dt. 



The kernel K of this operator is generated as follows. The nature performs step 1 by selecting 
two, possibly different, orthonormal bases {4>j^ and {(^j} in L^[0, 1]. The nature performs step 
2 by first selecting a bounded sequence < Uj < cr for j = 0, 1, sampling Uj as i.i.d. 1, 1], 

and then setting Aj = UjCTj. Finally, for some scalar k > it sets 

00 

3=0 

The operator defined in this way is well-defined over A and is bounded, but it need not be 
compact. If compactness is required, we impose X^^^i o"? < 00 in the construction, li K > Q 
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is required, we can impose cpo = 1,930 = li ^ c and < c, for all j, where c > 1 is 
a constant, and Yl'jLo^j < define instead Aq as c^°^]^Aj + |no|o"o. If in addition to 

positivity, J K{z,t)dt = 1 is required, for example if K{z,t) = f{t\z) is a conditional density, 
then we select k > so that kXq = 1. This algorithm for generating m' trivially obeys conditions 
1 and 2 stated above. Furthermore, uj need not be i.i.d. Take the extreme, opposite example, 
and set uj = U[—l, 1] for all j, that is Uj's are perfectly dependent. The resulting algorithm 
for generating m' still trivially obeys conditions 1 and 2. 

8.2 Proof of Lemma 3 

By assumptions there exists a compact, injective operator K, : A ^ B. By Theorem 15.16 in 
Kress (1999) /C admits a singular value decomposition: 



where {4>j} is an orthonormal subset of A, either finite or countably infinite, with cardinality 
N < 00, {^Pj} is an orthonormal subset of B of equal cardinality, and {fJ'j)^^ is bounded. Since 
ll^"^!!! = X^^o ('i^i' '^)^' injectivity of /C requires that must be an orthonormal basis in 
A and 7^ for all j. Therefore, step 1 is always feasible by using these {4>j} and {fj} in 
the construction. The order of eigenvectors in these sets need not be preserved and could be 
arbitrary. Step 2 is also feasible by using a product of Lebesgue- dominated measures on a 
bounded subset of R to define a measure over M^, or, more generally, using any construction of 
measure on from finite-dimensional measures obeying Kolmogorov's consistency conditions 
(e.g. Dudley, 1989) and the additional condition that r]{Xj^^ € ^ ^i-'-^^jk ^ ^} = if 

Leb(j4) = 0, for any finite subset {ji, jfc} C {0, N}. This verifies claim 1. 
To verify claim 2, by Bessel's inequality we have that 



N 




j=0 



N 



m'6\\s 



> 



j=0 



m' is not injective iff A^k^ = for some j 
r]{{Xj = 0}) = holding by assumption. 



By countable additivity and by Leb({0}) = 



TV 



Pr^(3je{0,l,...,A^} 



A,=0)<^?7({A, =0})=0. 



j=0 
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The final claim follows from the penultimate display. Q.E.D. 



8.3 Proof of Theorem 4 

By Assumption 3, for any a ^ and a G N'" with Fourier coefficients hj we have 

( ( 

— ao)||g > ^ Aij&j I ^ ^ \ = L ||a — aoll^ j 

so the conclusion follows by Theorem 2. Q.E.D. 

8.4 Proof of Corollary 5 

Consider a G AA'". Then 

Y,^,f^'~%]<L-'/^^-') (8.12) 
i 

For 6j = {a — aQ^4)j) note that ||a — aoll^i = (Z^j^j)"'^^^ by (j)2, ... being an orthonormal 
basis. Then 

/ X 1/2 / X (r-l)/2r / \ l/2r 

/ \ l/2r- 

where the first inequality holds by the Holder inequality, the second by eq. (j8.12p , and the third 
by Assumption 3. Raising both sides to the r^^ power and multiplying through by L gives 

L ||a — aoll^ < — "o)||g • (8.13) 

The conclusion then follows by Theorem 4. Q.E.D. 

8.5 Proof of Theorem 6 

Let F{y\X, W) = Pr(y < y\X, W), m{a) = E [1{Y < a{X))\W] - r, and 

m'h = E [/y(ao(A)| A, W)h{X)\W] , 
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so that by iterated expectations, 

m{a) = E [F{a{X)\X, W)\W] - r. 

Then by a mean value expansion, and by /y W) continuously differentiable 

\F{a{X)\X, W) - F{ao{X)\X, W) - /y(ao Wl^, W){a{X) - qo(X))| 
= {a{X)\X,W) - fY{ao{X)\X,W)] [a{X) - ao(,X)]\ < Li [a{X) - ao{X)f , 

where a{X) is the mean value that lies between a{X) and aQ{X). Then for L1L2 = L, 

\m{a){W)-m{ao){W)-m'{a-ao){W)\ < LiE [{a{X) - ao{X)}'^\W] 

< LE[{a(X)-ao(^)}^] = ^||a-ao||^- 

Therefore, 

||?n(a) — m(ao) — m'{a — ao)||g ^ L\\a — ao||^ , 

so that Assumption 2 is satisfied with r = 2 and J\f" = A. The conclusion then follows from 
Corollary 5. Q.E.D. 

9 Proofs for Section 4 

9.1 Useful Results on Projections on Linear Subspaces 

Before proving the next Theorem we give two useful intermediate result. Let Proj(&|A1) denote 
the orthogonal projection of an element 6 of a Hilbert space on a closed linear subset Ai of that 
space. 

Lemma Al: If a) A4 is a closed linear subspace of a Hilbert space H; b) bj & H {j = 
1, . . . ,p); c) the pxp matrix H with Hjk = (bj — Proj(6j | Al), 6^ — Proj(6fc|7W)) is nonsingular, 
then for b = (61, ... , 6^)"^ there exists e > such that for all a £MP and C G -M, 

\\b^a + C\\>e{\a\ + \\C\\). 
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Proof: Let bj = Proj(6j|A^), bj = bj — bj, b = (61, and b = (61, ...,bp)'^. Note that 



for £1 = v'Amin(n)/2, 
\\b^a + C\\ 



+ C + b^c 



b^c 



> 



b^a 



+ \\( + b^a\\)/V2 = {VaTna + \\C + b^ 



> ei |a| + IIC + 6^a|| /V2 



Also note that for any C* > \\bj\\ it follows by the triangle and Cauchy-Schwartz in- 

equalities that 



a\ < 



Choose C* big enough that ei/^/2C* < 1. Then by the triangle inequality, 



||C + 6^a||/^ > [ei/V2C*j\\C + b^a\\/V2 = ei\\C + b^a\\/2C* 
> £i(||C||-p^a||)/2C*>ei(||C||-C*|a|)/2C* 
= (ei/2C7*)||C||-ei|a|/2. 

Then combining the inequalities, for e = min{ei/2, ei/2C*}, 

||6'^a + C|| > ei|a| + (ei/2C*)||C||-ei|a|/2 

= (£i/2) \a\ + (£i/2C*) IICII > e (|a| + ||C||) .Q.E.D. 



Lemma A2: If Assumption 4 is satisfied then there is an e > such that for all {P,g) € 



- /3o| + \\m'Jg - 5'o)|L) < \\m'{a - qq)! 



Proof: Apply Lemma Al with T-L there equal to Hilbert space B described in Section 4, M 
in Lemma Al equal to the closed linear span of A4={m'g{g — go) : g G A/"^}, bj = m'^ej for the 
unit vector Cj, and a = /3 — /3o- Then for all {13, g) x M'g we have 

m'{a - Qo) = b^a + C, h^a = m'Jp - /3o), C = "m'ig - go) € M. 



The conclusion then follows by the conclusion of Lemma Al. Q.E.D. 
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9.2 Proof of Theorem 7 

Since Assumption 4 is satisfied the conclusion of Lemma A2 holds. Let e be from the conclusion 
of Lemma A2. Also let Mg = M'g H Mg for M'g from Assumption 4 and Mg from Assumption 5. 
In addition let B be from Assumption 5 with 

sup E[sup|aE[p(y,X,/3,5)|T^]/a/3-9E[p(y,X,/3o,5o)|VF]/9/3|'] 
9e< /'SB 

Then by m{(3Q,g) linear in g and expanding each element of m{[5,g){W) = ^[piY^X, l3,g)\W] 
in /3, it follows that for each {fi^g) £ B x Mg, if /3 / /3o) 

\\m{a) - m'{a - ao)!!^ = \\m{l3,g) - m{/3o,g) - m'p{l3 - /3o)||g 



dmCp,g)/d!3-m'p (/3 - /3o 



< 



m'p{j3,g) -m'p 1/3 - /3o 



< e\(3-(3o\ <e(|/3-/3o| + ||K(5-5o)|L) < ||m'(a-ao)| 



where /? is a mean value depending on W that actually differs from row to row of 



m'pC^^g) = dnp{y,X,~p,g)\W]/d(5. 

Thus, ||w,(a) — m'(a — ao)!!;? < ||?7i'(a — ao)||g, implying m{a) ^ 0, giving the first conclusion. 

To show the second conclusion, consider (/3, g) G J\f. If /3 7^ /3o then it follows as above that 
m[a) 7^ 0. If /3 = /3o and g ^ go then by linearity in g we have ||?ra(a) — m'{a — ao)||g = 
while ||m'(a — ao)!!;? = H'^glff ~ fi'o)!!^ > 0' s° m{a) 7^ follows as in the proof of Theorem 1. 
Q.E.D. 



9.3 Proof of Theorem 8 

Since Assumption 4 is satisfied the conclusion of Lemma A2 holds. Let e be from the conclusion 
of Lemma A2. Define B as in the proof of Theorem 7. By Assumption 2, for g G Mg , 
||m(/3o,5') — m'g{g — 5o)||g ^ -^115" 9o\\[a- Then similarly to the proof of Theorem 7 for all 
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a £ M with a / oq, 

||m(a) — m'{a — cto)]]^ 

< WMl^^a) -m{(3Q,g) - m^(/3 - /3o)||g + \\m{(3Q,g) - m'g{g - gQ)\\^ 

< e|/3-/3o| +^115-5011^ < e|/3-/3o| + £ \\m' {g - go)\\ 

< ||m'(Q - ao)||g . 

The conclusion fohows as in the conclusion of Theorem 1. Q.E.D. 
9.4 Proof of Theorem 9 

The proof will proceed by verifying the conditions of Theorem 7. Note that Assumption 4 is 
satisfied. We now check Assumption 5. Note that for any 5 > and g G A/"^, g{Xi + (3) is 
continuously differentiable in /3 with dg{Xi+ Xj f3)/dl3 = g'{Xi + X^ f3)X2. Also, for A ap x 1 
vector and B a neighborhood of zero it follows by boundedness of gQ and the specification of 
Af^ that for some C > 0, 

E[sup_ \g'{Xi + X^ifi + A))A:2| \W] < CE[\X2\ \W] < oo a.s. 

Therefore, by the dominated convergence theorem m{a)(W) = E[y — g{Xi + Xj/S)!^^] is 
continuously differentiable in /3 a.s. with 

dm{a){W)/d(3 = -E[g'{Xi + Xjl3)X2\W]. 

Next consider any e > and let B and 5 satisfy 

5 = {/3 : 1/3 - /3oP < eV4C2E[|X2|']} and 6^ < ey4E[\X2\% 

Then for g e J\f^ we have, for v{X, /3) = Xi + X^/3, 

E[sup \dm{a){W)/d/3 - mUW)]"^] 

/3GB 

= E[snp\E[{g'{v{X,(3))-g'o{V)}X2\wf]<E[\X2\^snp\g\v{X,f3))-g'o{V)\^ 

< 2E[\X2\' sup \g'{v{X, (3)) - g'o{v{X, + 2E[\X2\^ sup |5d(^^(X, /?)) - g'o{V)\^] 

paB fSeB 

< 26^E[\X2\^] + 2C^gE[\X2\^] sup |/3 - (3o\^ < e\ 

I3<^B 
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Thus Assumption 5 is satisfied so the first conclusion follows by the first conclusion of Theorem 
7. Also, m'g{g — go) = E[g{V) — go{V)\W] the rank condition for m'g follows by the the last 
bounded completeness on J\fg, so that the final conclusion follows by the final conclusion of 
Theorem 7. Q.E.D. 

9.5 Proof of Theorem 10 

Suppose first that the conditional distribution of W given V is complete. Note that by the 
projection definition, for all h{V) with finite mean-square we have 

= E[{-E[g',{V)X,,\W] - C*{W)}E[h{V)\W]] = E[{-E[g',{V)X,,\W] - C*{W)}h{V)]. 

Therefore, 

E[-E[g'oiV)X2,\W]-C;{W)\V]=0. 

Completeness of the conditional distribution of W given V then implies that — E[(7Q(y)X2j|VF] — 
CjiW) = 0, and hence Hjj = 0. Since this is true for each j we have 11 = 0, 11 is singular. 

Next, consider the second hypothesis and A 7^ 0. Let Ca(^) denote the projection of 
-E[g'o{V)X^X2\W] on M. Since E[h{V)\W] = E[h{V)\T{W)] it follows that Ca(^) is mea- 
surable with respect to (i.e. is a function of) T(W). Since E[(7Q(y)A^X2|lF] is not measurable 
with respect to T{W), we have -E[g'Q{V)X'^ X2\W]- CxiW) / 0, so that 

A^nA = E[{-E[g'oiV)X^X2\W] - Cx{W)}^] > 0. 

Since this is true for all A 7^ 0, it follows that H is positive definite, and hence nonsingular. 
Q.E.D. 

10 Proofs for Section 5 
10.1 Proof of Theorem 11 

The proof will proceed by verifying the conditions of Theorem 7 for p{Y,f3,g) from eq. (|5.7|) . 
Note that £[1)4^ and E[A|W^t] exist with probability one by EfDf] < 00 and that < 
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CD^. Then by the Cauchy-Schwartz inequahty, for any g £ Q we have by Dt > 1, and hence £[1^111^4] > 
1. 

\\E[Atg{ct+i)\Wt] - g{ct)e\\l < CE[E[AJ g{ct+i)\Wt]E[Atg{ct+i)\Wt] + g{ctf] 
< CE[E[D^\Wt]E[g{ct+if\Wt] + CE[E[Dl^\Wt-i]g{ctf] < C \\g\\l . 

Next, let Ht{(3,g) = 6Rt+ic^^^g{ct+i) and B = [6o - A,6o + A] x [70 - A,7o + A]. Note that 
Ht{l3,g) is twice continuously differentiable in /3 and by construction of Dt that 



sup 

/3gS 



dHtil3,g) 



9/3 



< Dtg{ct+i), sup 

I3€B 



d^Htil3,g) 



< A5(Q+i),(j = l,2). 



Therefore by standard results E[p(y, /?, = E[Ht{/3 , g)\Wt] — g{ct) is twice continuously 

differentiable in /3 on B, dE[p{Y, l3,g)\W]/dl3 = E[dHt{fi,g)/dp\Wt\, and 



Also, noting that 



m'p{W) = E[dHt{P^,gQ)/dp\Wt] = E[Atgo{ct+i)Xl\Wt]. 
m'p{W) < E[Dtgo{ct+i)\Wt], the Cauchy-Schwartz inequality gives 



E[|m'^(P^)|'] < E[E[D^\Wt]E[goict+i)^\Wt]] < \\go\\l < 00. 

Therefore it follows that — >• B is continuous, i.e. the first condition of Assumption 
4 is satisfied with J\fg = Q. 

Turning now to Assumption 5, we have 

\E[dHt+i{p,g)/dp - dHt+i{P,g^)/dmtf < E[D^\Wt]EMct+i) - go{ct+i)\^\Wt], 
\E[dHt+i{f3,go)/dfi - dHt+i{f]o,9o)/dmtf < E[D^\Wt]E[go{ct+if\Wt] |/3 - f^of ■ 



Therefore we have 



dE[piY,f3,g)\W] dE[p{Y,Po,go)\W] 



dp 



dp 



= \E[dHt+^{P,g)/dp - dHt+iiPo,9o)/dP\Wtf 

< 2E[D^\Wt]{E[\g{ct+i) - go{ct+i)\^\Wt] + E[go{ct+if\Wt] |/3 - Pof}. 

Note that by iterated expectations, 

E[E[D^t\Wt]E[\g{ct+i) - go{ct+i)\^\Wt]] = \\g - go\\l , 
E[E[D^\Wt]E[goict+i)\^\Wt]] = \\go\\l ■ 
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Consider any e > 0. Let 

= {g : \\g - g^g < e/2} and B = B n {P : \(3 - /3o| < e/{2 \\go\\g). 
Then for g G Afg we have 

E[sup \dm{a){W)/df] - m'f,{W)f] <2\\g- gofg + 2 ||go||g sup |/3 - /3o|' < e^. 

Therefore Assumption 5 holds with B there equal to B here. The conclusion then follows by 
Theorem 7. Q.E.D. 

10.2 Proof of Theorem 12 

Let a{ct+uWt) = nAtj\ct+i,Wt] and d{ct+i) = E[E[L»2|W^t]|Q+i]. Let B ={h{Wt) : ^KWif] < 
oo operator L : Q — >B given by 

Lg = E[^i,5(Q+i)|W^i]= /a(c,W^i)5(c)%^^dc 

g{c)K{c, Wt)Mc)d{c)dc, K{c, Wt) = \ 

Note that d{c) > 1 by > 1. Therefore, 

2- f d{c,wf' f^y^{c,w) 

K{c,w) d{c)fc{c)f^{w)dcdw = / „ , .i, . fcwi^^'^)^^^'^ 

J fwi'^)fc{c)d{c) ^'^ 



E[E[^t,|ct+i,T^f] 



2 /c,iy(ct+l>^t) 



Uct+i)f^{Wty 

< E[4./,(Q+i)-VH/W)-Ve,vi/(cm,W^t)] <00. 

It therefore follows by standard results that L is Hilbert-Schmidt and thus compact. Further- 
more, it follows exactly as in the proof of Theorem 3.2 of Kress (1999), that 

M ={E[Atjg{ct+i)\Wt] - g{ct) : g e G] 

is closed. 
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Next let b = 62)"^ be a constant vector and A(c) = bi/So — 62 ln(c). Suppose b^Ub = 0. 
Then by the definition of 11 there is gk & G sucli tliat 

E[Atgk{ct+i)\Wt] - gk{ct)e E[Atgo{ct+i)A{ct+i)\Wt] 
in mean square as k — > 00. It follows that 

E[Atjgk{ct+i)\Wt] - gu{ct) E[At,go{ct+i)A{ct+i)\Wt] 
in mean square. By M a closed set there exists g*{c) such that 

E[At,go{'^t+i)Hct+i)\Wt] = E[Atjg*{ct+i)\Wt] - g*{ct). 

If g*{ct+i) = then E[Atjgo{ct+i)A{ct+i)\Wt] = and by completeness of E[Atjh{ct+i,ct)\Wt] 
it follows that (7o(q+i)A(q+i) = 0. Then by Pr((j(o(ct+i) 7^ 0) = 1, we have A(ct+i) = 0. 

Next, suppose Vi:{g*{ct) 7^ 0) > 0. Then Y'i{m.\n{\g* {ct)\, gQ{ct)} > 0) > 0, so for small 
enough e > 0, Pr(min{|5f*(ct)|, 50 (ct)} > e) > 0. Let If = l{va\n{\g* {ct)\, go{ct)} > £,). Then 
multiplying through 

nAt,i, m = 1, 

By eq. (jS.Sp we also have 

By completeness it then follows that 

^g go(Q+i)A(Cf+i) - g*(Cf+i) ^ ^g ffo(Q+i) 

* -5*(ct) * ffo(ct) ■ 

Holding cj fixed at any c it follows that (7o(q+i)A(q+i) — = C(7o(q+i) for some nonzero 

constant C. Then taking conditional expectations of both sides it follows that g*{ct) = Cgo{ct). 
Then taking conditional expectations again, 

E[Atjgo{ct+i)A{ct+i)\Wt] = E[Atj g* {ct+i)\Wt] - g*{ct) = C {E[Atj go{ct+i)\Wt] - go{ct)} = 0. 

By completeness it follows that 5fo(ct+i)A(Q+i) = so A(q+i) = follows by Pr(5o(ct) = 0) = 
0. Therefore, we find that 6-^116 = implies A(cj+i) = 0. But we know that for 6 7^ it is the 
case that A(cf+i) 7^ 0. Therefore, 6 7^ implies 6^116 > 0, i.e. 11 is nonsingular. 
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Next, under the last condition of Theorem 12, if E[^(j(7(ct+i)| Wf] = g{ct), it follows that 
for c as given there, 

5o(q+i)| 



E[Atj^^^\wiZt),ct = c] = l = E[At,- 



\w{Zt),ct = c]. 



9{c) 50 (c) 

Then by completeness of E[Atjg{ct+i)\Wt] it follows that g{ct+i)/g{c) = go{ct+i)/goic), i.e. 

9{ct+i) = go{ct+i)g{c)/go{c), 



so g is equal to go up to scale. Q.E.D. 

10.3 Completeness and Global Identification in the CCAPM 

Theorem A. 3. // {Rt,ct) is strictly stationary, ct is continuously distributed with support 
[0, oo), go{c) > is bounded and bounded away from zero, E[R^_^i ^'^"J < oo, and there is c 
such that E[Rt+ijh{ct+i)\w{Zt),c] = and E[R^_^_ijh{ct+i)'^] < oo implies = then 

('^OjTOjS'o) is identified (go up to scale) among all {5,^,g) with g{c) > 0, g{c) bounded and 
bounded away from zero, and E[ii^_|_]^ ^c^^'^] < oo. 

Proof: Consider any two solutions (/3o,5o) and {j3i,gi) to equation ()5.5p satisfying the 
conditions of Theorem A. 3. Then by iterated expectations, 



E 



9o[c) 



1 = E 



w{Zt),c 



By completeness with h{ct+i) = 5oc^^lgo{ct+i) / go{c) - 5ic^^lgi{ct+i)/gi{c) it follows by mul- 
tiplying and dividing that 



„7i -70 



9i{ct+i) 



So9i{c) 



5o(q+i) 

Since the object on the right is bounded and bounded away from zero and the support of ct-\-i 
is I = [0, oo) it follows that 70 = 71- Then we have 

'hgoic)' 



9o(q+i) = 9i{ct+i] 



a.e. in / , 



_Sogi{c)_ 

so that there is a constant D > such that go{ct+i) = Dgi{ct+i) a.e. in /. We can also assume 
that go{c) = Dgi{c) since Cf is continuously distributed. Substituting then gives D = {6i/6o)D, 
implying 61 = 6o- Q.E.D. 
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Previously Chen and Ludvigson (2009) show global identification of (^OiTOifi'o) under differ- 
ent conditions. In their results E[Rt+ijh{ct+i,ct)\w{Zt),ct] is assumed to be complete, which 
is stronger than completeness at Cf = c, but g{c) is not assumed to be bounded or bounded 
away from zero and the support of q need not be [0,oo). 

10.4 A Useful Result on Uniqueness and Existence of Positive Eigenfunc- 



The following result and its proof in part relies on the fundamental results of Krein and Rutman 
(1950), specifically their Theorem 6.2 and example Krein and Rutman (1950) extended the 
Perron-Frobenius theory of positive matrices to the case of operators leaving invariant a cone 
in a Banach space. 

Let / be a Borel subset of M™ and fi he a c-finite measure with support /. Consider the 
space L'^{fi), equipped with the standard norm || • ||. We consider the following conditions on 
the kernel K: 

1. K(s,t) is a non-negative, measurable kernel such that J J K^{s,t)dfi{t)di-t{s) < oo. 

2. K{s, t) = on a set of points {t, s) of measure zero under /i x 
Consider an integral operator L from i^^(/u) to L'^{lj) defined by: 



It is known that these operators are compact under condition 1. The lemma given below shows 
that under these assumptions we have existence and global uniqueness of the positive eigenpair 
(/>, if) such that L(p = pip, in the sense that is stated below. This lemma extends example 
/3' outlined in Krein and Rutman (1950) that looked at the complex Hilbert space L^[a, 6], 
0<a<6<oo, an extension which we were not able to track easily in the literature, so we 
simply derived it; we also provided an additional step (3), not given in the outline, to fully 
verify uniqueness. Note that we removed the complex analysis based arguments, since they are 
not needed here. 



tions 




and its adjoint operator 




[35] 



Lemma A. 4. Under conditions 1 and 2, there exists a unique eigenpair {p,(p), consisting of 
an eigenvalue p and eigenfunction ip such that Lip = pp and p > 0, ||(^|| = 1, > 0; moreover, 
99 > p-a.e. 

Proof. The proof is divided in five steps. 

(1) Let C° be the cone of nonnegative functions in ^ = L?'{p). In the proof we shall use 
the following result on the existence of non-negative eigenpair from Krein and Rutman (1950, 
Theorem 6.2). 

Consider a cone C° in a Banach space A such that the closure of the linear hull of 
C° is A. Consider a linear, compact operator L : A^ A such that LC° C C°, and 
such that there exist v G with H-yH = 1 and 5 > such that Av — 5v G C°. Then 
it has a positive eigenvalue p, not less in modulus than every other eigenvalues, and 
to this eigenvalue there corresponds at least one eigenvector 93 E C° of the operator 
L {Lip = pip) and at least one characteristic vector 7^ of the dual operator L* 
{L*ij = p^). 

The theorem requires that the closure of the linear hull of the cone is A. This is true in our 
case for A = L'^{p) and the cone C° of the non-negative functions in A, since C° — C° is dense 
in A. Moreover, since a = f K{s,t)dp{s)dp{t) > 0, we have that Av > av for v = 1. Therefore, 
application of the theorem quoted above implies that there exists p > and ip and ip s.t. p-a.e. 

ip{s) = p-i J K{s,t)ip{t)dp{t), ip>0, \\ip\\ = 1, p > 0; (10.14) 
V^(s) = I K{t, s)mdp{t), IIVII = 1. (10.15) 

(2) We would like to prove that any eigenvalue p > associated to a nonnegative eigenfunc- 
tion ip > must be a simple eigenvalue, i.e. p is the only eigenfunction in L^{p) associated 
with p. For this purpose we shall use the following standard fact on linear compact operators, 
e.g. stated in Krein and Rutman (1950) and specialized to our context: An eigenvalue p of L 
is simple if and only if the equations Lip = pp and L*ip = pip have no solutions orthogonal to 
each other, i.e. satisfying (p / 0, ^ / 0, / ilj{s)p{s)dp{s) = 0. So for this purpose we will show 
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in steps (4) and (5) below that ij: is of constant sign fi-a.e. and <p and "0 only vanish on a set 
of measure under /x. Since if > 0, this implies 



and we conclude from the quoted fact that p is a simple eigenvalue. 

(3) To assert the uniqueness of the nonnegative eigenpair (p, ip) (meaning that Lip = p^p, 
/9 > 0, 9? > 0, \\ip\\ = 1), suppose to the contrary that there is another nonnegative eigenpair 
(r, Q) . Then r is also an eigenvalue of L* by the Fredholm theorem (Kress, 1999, Theorem 
4.14), which implies by definition of the eigenvalue that there exists a dual eigenfunction i] ^ 
such that L*r] = rij and \\r]\\ = 1. 

By step (4) below we must have ( > 0, ip > p-a.e. Hence by step (5) the dual eigenfunctions 
77 and ^ are non- vanishing and of constant sign p-a.e., which implies f r]{s)(p{s)dp{s) 7^ 0. 
Therefore, r = p follows from the equality: 



(4) Let us prove that any eigenfunction ip > of L associated with an eigenvalue p > 
must be p-a.e. positive. Let S denote the set of zeros of 93. Evidently, p{S) < p{I). If s G 5, 
then 



Therefore K{s,t) vanishes almost everywhere on (s,t) £ S x {I\ S). However the set of zeroes 
of K{s, t) is of measure zero under p x p, so p{S) x p{I \ S) = 0, implying p{S) = 0. 

(5) Here we show that any eigen-triple (p, 93, ijj) solving (|10.15p and (jl0.14p obeys: 




r 




K{t, s)r]{t)dp{t)ip{s)dp{s) = p r]{t)ip{t)dp{t). 




sign(^(s)) = 1 p-a.e. or sign{ip{s)) 



1 p-a.e. 



(10.16) 



From equation (jlO.lSp it follows that p-a.e. 




[37] 



Multiplying both sides by (p{s), integrating and applying (jl0.14p yields 

\^P{s)\^is)dfi{s) < [ [ Kit,s)ip{s)\m\dpit)dpis)= [ \mMi)Mt)- 



It follows that /U-a.e. 

ms)\ = p~'J K{t,s)\mw{t), 

i.e. IV'I is an eigenfunction of L*. 

Next, equation \ip{s)\ = V'(s)sign('(/'(s)) implies that n-a.e. 

I K{t,s)\m\dp{t)=p~' J K{t,s)ij{t)dpmgn{i;{s)). 

It follows that for a.e. (t, s) under ^ x p 

=^(t)sign(V'(s)). 

By the positivity condition on i^T, |^| > p-a.e. by the same reasoning as given in step (4). 
Thus, (flOl^ follows. Q.E.D. 

10.5 Proof of Theorem 13. 

Note that K{c,s) = r{c, 3)3'^° f{s,c)/[f{s)f{c)] > almost everywhere by r(c, s) > and 
/(s, c) > almost everywhere. Therefore the conclusion follows from Lemma A. 4 with /(s)ds = 
dp{s). Q. E. D. 

11 Tangential Cone Conditions 

In this Appendix we discuss some inequalities that are related to identification of ckq . Through- 
out this Appendix we maintain that m(ao) = 0. Define 

M = {a: m{a) / 0}, J\f' = {a : m'{a - ckq) / 0}, 
A/"^ = {a : ||m(a) — m'(a — ao) llg < 77 ||?n''(Q — ao) llg}; 0, 
Mri = {a : ||m (a) — m' (q — ao)||g < ?? (a)llg}, 77 > 0. 

Here N can be interpreted as the identified set and N' as the set where the rank condition 
holds. The set A/"^ is a set on which an inequality version of equation (j2.2p holds. The inequality 
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used to define N-q is similar to the tangential cone condition from tlie literature on computation 
in nonlinear ill-posed inverse problems, e.g. Hanke, M., A. Neubauer, and O. Scherzer (1995) 
and Dunker et. al. (2011). 

The following results gives some relations among these sets: 

Lemma A. 5: For any r] > 0, 
If <r] <1 then 

Proof: Note that a G Mr^ and the triangle inequality gives 

- ||m(a)||g + \\m'{a - ao)||g < r] ||m(a)||g 

so that ||m(a)||g > (1 + r])~^ \\m'{a — ao)|le • Therefore if a G Mrj H M' we have ||m(a)||g > 0, 
i.e. a G Af, giving the first conclusion. Also, if a G we have 

— ||m'(a — ao)||g + II"'-(q^)IIb ^ V \\m'{a — ao)||g 

so that \\m' {a — ao)||g > {l+r])~^ ||m(a)||g . Therefore, if a G M^nM we have ||m'(a — ao)!!^ > 
0, giving the second conclusion. 
Next, for < ?7 < 1, we have 

- \\m'{a - ao)||g < r] ||m(a)||g 

so that ||m'(a — qo)|| > (1 — ?/) ||?^(a)|li3 • Therefore, if a G A/';,nAA we have ||?n'(a — ao)!!;? > 0, 
giving the third conclusion. Similarly, for < < 1 we have 

||m'(Q — ao)||g — lb^('^)llB ^ V ||"^'(a — '^o)||g 

so that ||m(a)||g > (1 — ??) ||m'(Q — ao)||g. Therefore if q G CiM' we have ||W'(a)||g > 0, 
giving the fourth conclusion. Q.E.D. 

The first conclusion shows that when the tangential cone condition is satisfied the set on 
which the rank condition holds is a subset of the identified set. The second condition is less 
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interesting, but does show that the rank condition is necessary for identification when a G M'^. 
The third conclusion shows that the rank condition is also necessary for identification under 
the tangential cone condition for < 77 < 1. The last conclusion shows that when ol G M'^ with 
< < 1 the rank condition is sufficient for identification. 

When the side condition that a G Mr] or a G A/"^ are imposed for < < 1, the rank 
condition is necessary and sufficient for identification. 

Corollary A. 6: // < 77 < 1 then 

Proof: By intersecting both sides of the first conclusion of Lemma A. 5 with Mrj we find that 
Mri n J\f' C Mr] n A/". For ?7 < 1 it follows similary from the third conclusion of Lemma that 

n A/" C M'rf n A/"', implying Mrj H A/"' = Mrj H AA, the first conclusion. The second conclusion 
follows similarly. Q.E.D. 

The equalities in the conclusion of this result show that the rank condition (i.e. a G M') is 
necessary and sufficient for identification (i.e. a G M) under either of the side conditions that 

a G A/"^ or a G A/",,, < r/ < 1. 

In parametric models Rothenberg (1971) showed that when the Jacobian has constant rank in 
a neighborhood of the true parameter the rank condition is necessary and sufficient for local 
identification. These conditions fill an analogous role here, in the sense that when a is restricted 
to either set, the rank condition is necessary and sufficient for identification. 

The sets M-q and A/"^ are related to each other in the way shown in the following result. 

Lemma A. 7. // < ?y < 1 then Mr, C -^^/(i^^) an-d M^ C Nr,/{i-n)- 

Proof: By the triangle inequality 

||?7i'(q! — ao)||g ^ II^T-lo^) ~ ''"'(q^ ~ cto)||g + ||m(Q;)||g , 

||W'(a)||0 < ||m(Q;) — m'(a — ao)||g + ||?7i'(q; — ao)||g • 
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Therefore, for a G Mr 




'{a - ao)||g + V \\m'{a - oq) 



Subtracting from both sides and dividing by 1 — 77 gives a G M' 



. The second conchision 



follows similarly. Q.E.D. 
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